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Abstract 

A computer-aided detection (CAD) system for the identification of 
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lung internal nodules in low-dose multi-detector helical Computed To- 
mography (CT) images was developed in the framework of the MAGIC- 
5 project. The three modules of our lung CAD system, a segmenta- 
tion algorithm for lung internal region identification, a multi-scale dot- 
enhancement filter for nodule candidate selection and a multi-scale neural 
technique for false positive finding reduction, are described. The results 
obtained on a dataset of low-dose and thin-slice CT scans are shown in 
terms of free response receiver operating characteristic (FROC) curves 
and discussed. 

keywords: Computed Tomography (CT), Computer- Aided 
Detection (CAD) 

1 INTRODUCTION 

Lung cancer is the leading cause of cancer-related mortality in developed 
countries [1,2]. Only 10 -15% of all men and women diagnosed with lung 
cancer live five years after diagnosis [2, 3] and no significant improvement 
has occurred in the lEist 20 years [4]. Early-stage cancer is asymptomatic, 
so more than 70% of patients diagnosed with lung cancer are in the ad- 
vanced stages of the disease, when it's too late for effective treatments [5]. 
However the five-year survival rate for people who are diagnosed with 
early-stage lung cancer (stage I) can reach 70% [6]. 

In this scenario, the implementation of screening programs for the asymp- 
tomatic high-risk population is an approach that is being tried to reduce 
the mortality rate of lung cancer. It was proved that screcrnng pro- 
grams with X-ray radiography don't lead to a reduction of the mortality 
rate [7-10], due to the low sensitivity of this technique in the identification 
of small, early-stage cancers. 

Lung cancer most commonly manifests itself with the formation of non- 
calcified pulmonary nodules. Computed Tomography (CT) is proved to be 
the best imaging modality for the detection of small pulmonary nodules, 
particularly since the introduction of the multi-detector-row and helical 
CT technologies [11-13]. 

Therefore CT-based screening programs are regarded as a promising tech- 
nique for detecting small, early-stage lung cancers [14,15]. In CT-based 
screening protocols, low-dose settings are required, since the examined 
population is asymptomatic, and therefore potentially healthy. However, 
low-dose images are noisier than standard-dose ones, so it's more difficult 
to identify small nodules when low-dose settings are used. Moreover, the 
amount of data that need to be interpreted in CT examinations can be 
very large, especially in screening programs, when a thin slice thickness 
is usually used, thus generating up to about 300 two-dimensional images 
per scan. Computer- Aided Detection (CAD) could support radiologists 
in the identification of small, early-stage pathological objects in screening 
CT scans. 

The MAGIC-5 project [16] aims at developing CAD software systems 
for Medical Applications on distributed databases by means of a GRID 

Infrastructure Connection approach [17]. In particular, MAGIC-5 re- 
searchers work on mammographic images for breast cancer detection. 
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NMR-SPECT-PET images for the diagnosis of the Alzheimer disease 
and Computed Tomography images for lung cancer identification. The 
CAD system we propose for small pulmonary internal nodule identifica- 
tion was developed and validated on a set of images acquired from the 
Pisa centre of the First Italian Randomized Controlled Trial (ITALUNG- 
CT) , recently started in order to study the potential impact of screening 
on a high-risk population using low-dose helical CT [18,19]. 
The CAD system is a three steps procedure: a segmentation algorithm 
identifies the lung internal region, then a multi-scale dot-enhancement fil- 
ter provides a list of nodule candidates and finally a multi-scale neural 
network-based classification module reduces the number of false positive 
findings per scan. 



2 THE LUNG CT DATABASE 

A low-dose lung CT database was acquired from the Pisa centre of the 
ITALUNG-CT trial, the First Italian Randomized ControUed Trial for 
the screening of lung cancer [18,19]. The CT scans are acquired with a 
4-slice spiral CT scanner according to a low-dose protocol (screening set- 
ting: 140 kV, 20 mA, mean equivalent dose 0.6 mSv), with 1.25-mm slice 
coUimation. 

Each scan is a sequence of slices stored in DICOM (Digital Imaging and 
communications in Medicine) format. The reconstructed slice thickness 
is 1 mm. The average number of slices per scan is about 300 with a 
512x512 pixel matrix, a pixel size ranging from 0.53 to 0.74 mm and 
12 bit grey levels in Hounsfield units (HU). 

The pathological structures to be automatically detected by a CAD sys- 
tem are non calcified nodules. Such nodules can be divided into three 
main categories, depending on their location in the lung: internal nodules, 
fully contained in the lung parenchyma, sub-pleural nodules, originated 
inside the lung parenchyma but adjacent or connected to the pleura, and 
pleural nodules, originated from the pleura and grown toward the lung 
parenchyma. 

The identification of internal, sub-pleural and pleural nodules requires 
dedicated procedures, due to their different location and shape (see fig- 
ure [!}. Our preliminary results on sub-pleural and pleural nodule identi- 
fication can be found in [20,21]. 

In this study, only internal nodule identification is considered. According 
to ITALUNG-CT screening protocol, in the baseline CT examinations ra- 
diologists mark the nodules with a diameter greater than 5 mm, which 
can be a sign of the presence of lung cancer at an early stage, thus al- 
lowing an early diagnosis of this disease. Once identified, nodules are 
kept under control by means of follow up CT examinations, in which the 
nodules with diameters between 3 and 5 mm should also be marked, but 
only if newly formed. Only nodules with diameters greater than 8-10 mm 
are subjected to further examinations like Positron Emission Tomography 
(PET) or biopsy. 

The CT database acquired for this study is constituted by baseline exam- 
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Figure 1: Examples of small pulmonary nodules: a) internal nodule; b) sub- 
pleural nodule; c) pleural nodule. 

inations, so our goal is to develop a CAD system for the identification of 
internal nodules with diameters greater than 5 mm. Two experienced ra- 
diologists have selected the structures of interest from the so far collected 
database of 39 CT scans. This task has been carried out by means of a 
dedicated visualization and annotation tool developed in the framework 
of the MAGIC-5 collaboration. The resulting dataset consists of 75 solid 
internal nodules with diameters in the 5-12 mm range. The maximum 
value is 12 mm, but 96% of the nodules have diameters ranging from 5 to 
8 mm. Examples of solid internal nodules extracted from our dataset are 
shown in figure [2] they may have CT values in the same range of those of 
blood vessels and airway walls and may be strongly connected to them. 
Some internal ground-glass opacities had also been selected by radiol- 
ogists, but their number was too small to allow a dedicated analysis; 
therefore this type of pathological objects was excluded from our target 
list. 




Figure 2: Examples of internal small pulmonary nodules. 

3 THE CAD SYSTEM 

As explained in the Introduction, our CAD system consists in three main 
modules: first of all, the lung internal region is identified by means of a 



4 



purposely built segmentation algorithm. As a second step, nodule can- 
didates are detected using a multi-scale 3D filter enhancing spherically- 
shaped objects. Finally, a multi-scale voxel-based neural technique is 
implemented to reduce the amount of false positive findings per scan. 

3.1 Lung internal region segmentation 

A lung volume 3D segmentation algorithm was implemented according 
to the procedure proposed in [22]. First of all, to separate the low- 
intensity lung parenchyma from the high-intensity surrounding tissue (fat 
tissue and bones), the voxel intensities are thresholded at a fixed value 
(-400 HU) [22,23]. Then, in order to discard all the regions not belonging 
to the lungs, the connected lung regions for the left and right lungs are 
selected starting with a seed point inside each lung. At this stage ves- 
sels and airway walls are not included in the segmented lung. Finally, a 
combination of the erosion and dilation morphological operations, known 
as rolling-ball algorithm [24], is applied. The rolling-ball operator uses a 
spherical kernel, having the efi^ect of including in the segmented lung all 
the vessels and all the airway walls smaller than the ball size: a radius of 
10 voxels for the spherical kernel was chosen in order to include all the 
objects within our nodule dimension target. An example of the various 
stages of the lung volume segmentation algorithm is shown in figure [31 




Figure 3: Lung volume 3D segmentation algorithm (a slice of the full 3D volume 
is shown): a) algorithm input (the original CT scan); b) scan after threshold- 
ing; c) scan after selection of connected regions and rolling ball algorithm; d) 
algorithm output (segmented lung volume). 

Since our goal is internal nodule research only, the lung volume identified 
by the segmentation algorithm is eroded on the external side by a 2.5 mm- 
thick band, so as to define the lung internal region, where internal nodules 
with a diameter greater than 5 mm are expected to be found. In figure|4]an 
example of lung internal region is shown, compared with the lung volume 
identified by simply implementing the segmentation algorithm described 
above. 
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Figure 4: a) Original volume; b) lung volume obtained by the segmentation 
algorithm (outer contours) and lung internal region. 

3.2 Nodule candidate identification 

The automated nodule candidate detection should be characterized by a 
sensitivity value close to 100%, in order to avoid setting an a priori upper 
bound to the CAD system performance. To this aim, wc followed the 
approach proposed in [25]. Lung nodules arc modeled as spherical objects 
with a Gaussian profile and the 3D matrix of data is filtered with a filter 
function Zdot built to discriminate between spherical objects and objects 
with planar or elongated shapes. In particular, Zdot is defined as follows 
firom the eigenvalues of the Hessian matrix of each voxel: 



where Ai,A2, A3 are the eigenvalues of the Hessian matrix of each voxel, 
sorted so that |Ai| > IA2I > lAs]. 

To enhance the sensitivity of this filter to nodules of different sizes, a multi- 
scale approach is followed. According to the indications given in [25-27], 
the Zdot function is combined to a Gaussian smoothing at several scales 
^ min — (Tlj . . . J CT max = cTjv- Within the range [cri,o"jv], intermediate 
smoothing scales are computed as = r'~^iTi ior i = 2, . . . , N — 1, where 
r = (o-jv/cTi)^/^^^. The final filter value Zmax assigned to each voxel 
is defined as the maximum Zdot value obtained from the different scales 
(denoted Zdot{ci) for i = 1, . . . , iV), multiplied by the relative scale factor: 



Once the 3D filtered matrix is calculated, a peak-detection algorithm is 
applied to detect the local maxima, which, sorted in decreasing order, are 
the list of nodule candidates identified by the filter. 

The range [ai,aN] and the number N of smoothing scales have to be 
chosen in order to make the filter able to enhance nodules of the desired 
dimension target. 

Assuming a nodule can be denoted by a 3D Gaussian function of scale 
(T, its diameter can be reasonably assumed to be 4cr, thus accounting for 



Zdoti^l, A2, A3) 



{ 



IA3IVIA1I if Ai,A2,A3<0, 
otherwise. 



= max (7iZdot{cri). 

ie{i,...,JV} 
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Figure 5: Response of the multi-scale filter implemented in the scale range 
ai = 1.25 mm, CTAr=2 mm and N=5, to synthetic nodules modeled as 3D Gaussian 
functions with scale a varying between 0.25 mm and 5 mm. 



more than 95% of the nodule volume. Therefore implementing the multi- 
scale filter in a range [ai, (T]v] allows to enhance nodules with a diameter 
in a range [dmin = ai * i, d^ax = cr]v * 4]. 

In figure [5] the response of the filter, implemented with a"i = 1.25 mm, 
crjv=2 mm and N—5, to synthetic nodules (3D Gaussian functions with 
scale a varying between 0.25 mm and 5 mm), is shown. As expected, 
the maximum filter response is obtained for nodule diameters between 
(ji * 4=1.25*4 mm=5 mm and (tat * 4=2*4 mm=8 mm. However it can 
be noticed that the filter response remains higher than the 90% of the 
maximum value for nodule diameters in all our dimension target, between 
5 and 12 mm. Moreover, we found that 5 is the minimum value of A'^ 
required to obtain a fiat response. Therefore in this work (7i=1.25 mm, 
cr jv=2 mm and N—5 were chosen to identify nodules in our dimension 
target. 

The dot-enhancement filter implemented with these parameters was run 
on the entire lung volume identified by the segmentation algorithm pro- 
posed in [22] (see figure [3] and figure |4]). Then, from the filter output 
list, a list of internal nodule candidates was created, constituted by filter 
outputs located in the previously identified lung internal region only (see 
figure |4]). As it is explained in the next paragraph, the list contains a 
large number of false positives (FP). 
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3.3 False positive reduction 

3.3.1 The basic idea of the Multi-Scale Voxel-Based Neu- 
ral Approach (MS-VBNA) 

Most false positive findings are crossings between blood vessels. In figure[S] 
some examples of FP are shown. 



Figure 6: Some examples of false positive findings generated by the dot- 
enhancement filter. 

To reduce the number of FP/scan, we developed a procedure called Multi- 
Scale Voxel-Based Neural Approach (MS-VBNA). 

First of all, a region of interest (ROT) is defined from each internal nodule 
candidate of the filter output list as the set of the voxels v belonging to a 
sphere of radius 5 pixels around the voxel identified by the filter and with 
intensity value 7„ above a relative threshold t 



ROIs are so defined in order to include voxels of the structures of interest 
(nodules or FP as, for example, blood vessel crossings) and not back- 
ground voxels. In order not to have in a ROI only voxels belonging to 
calcific structures that could be present in the sphere, ROIs are identified 
only once all the voxels with /„ > 200 HU are set to 200 HU. In figure [7] 
an example of ROI corresponding to an internal nodule of diameter 5.5 
mm is shown. 

At this stage we consider a nodule candidate, and consequently its ROI, as 
corresponding to a nodule (in other words, we consider a nodule found by 
the filter) if the voxel identified by the filter lies within a sphere centered 
on the nodule and having diameter equal to the nodule dimension. 

The basic idea of the MS-VBNA is to associate to each voxel of a ROI a 
feature vector constituted by the intensity values of its 3D neighbors, the 
three eigenvalues of the gradient matrix defined as 






? = 1,2,3, 
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Figure 7: Consecutive slices containing an internal nodule of diameter 5.5 mm 
(top) and corresponding ROI (bottom). 



where I{xi,X2,X3) is the intensity function and the sums are over the 
neighborhood area, and the three eigenvalues of the Hessian matrix de- 
fined as 



hj = 1,2,3, 



where I{xi,X2,X3) is the intensity function [28] (see figure |8}. As we 
proved in [29], using these six features, in addition to the simple voxel 
neighborhood rolled down into a vector, improves the system discrimina- 
tion capability. 
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Figure 8: Basic idea of the Voxel-Based Neural Approach to false positive reduc- 
tion: each voxel is characterized by a feature vector constituted by the intensity 
values of its 3D neighbors and the eigenvalues of the gradient and the Hessian 
matrices. 

Feature vectors are then classified by a standard three-layer feed-forward 
back-propagation neural network which is trained and tested to assign 
each voxel either to the nodule or normal tissue target class. 

3.3.2 The training and testing phase 

The network training and testing phase was carried out as follows. The 
available dataset of 39 scans containing 75 internal nodules was parti- 



9 



tioned into a teaching set of 15 scans containing 30 nodules and a valida- 
tion set of 24 scans containing 45 nodules; the partition was defined so as 
to make the teaching set representative of all the nodule dimensions. 
In figure [9] the distribution of nodule diameters in the teaching set and in 
the validation set is shown. 




Figure 9: Diameter distribution of the 30 internal nodules in the teaching 
and the 45 internal nodules in the validation set. 

Then, the 15 scans of the teaching set were analyzed and used to set the 
parameters for the training and testing phase. In particular, ROIs cor- 
responding to the 30 nodules in the teaching set were considered. We 
know that clearly the number of voxels of a ROI doesn't provide a precise 
measurement of the nodule dimension, due to the ROI definition itself. 
However the mean number of voxels increases as the nodule dimension 
increases. In fact the mean number of voxels for nodules with diameters 
until 6 mm is 46, for nodules with diameters between 6 and 7 mm is 96 
and for nodules with diameters above 7 mm is 124. 

In particular, for all nodules in the teaching set the number of voxels in 
the ROI is greater than 20; moreover, for all but one nodules with diam- 
eters above 7 mm, the number of voxels in the ROI is greater than 100 
and all but two ROIs with more than 100 voxels correspond to nodules 
greater than 7 mm. 

Following the hypothesis that the network could really learn to recognize 
a nodule if the neighborhoods used to create the feature vector for the 
ROI voxels were large enough to intersect the nodule edge, we decided 
to train and test the network using two different neighborhood sizes; in 
particular 7x7x3 (7x7 voxels for three consecutive slices) was chosen as 
small size and 13x13x5 (13x13 voxels for five consecutive slices) was 
chosen as large size. According to our hypothesis, using the small neigh- 
borhood allowed the network to recognize at the most nodules of 7 mm of 
diameter, whereas for larger nodules the larger neighborhood was surely 
necessary. 
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Therefore, using the relation described above between the number of vox- 
els in the ROIs and nodule dimensions, we decided to train and test the 
neural network using the small neighborhood size for ROIs with less than 
100 voxels, the large neighborhood size for ROIs with more than 100 vox- 
els. 

The goal of this threshold of 100 voxels is not to define a real partition 
between ROIs that could be recognized using small and large neighbor- 
hoods; it is only an indicative threshold to be used to train and test the 
network in an effective way. 

It can easily be calculated that a feature vector deriving from a 7x7x3 
neighborhood has 153 entries, 147 deriving from the neighborhood rolled 
down and 6 deriving from the gradient and the Hessian matrices. A 
feature vector deriving from a 13x13x5 neighborhood would have 851 
features. So we decided not to consider all the neighborhood but only a 
down-sampled part, as shown in figure 1101 In this way a feature vector 
deriving from a 13x13x5 neighborhood has also 153 entries. As a con- 
sequence, we trained and tested a feed- forward back-propagation neural 
network with 153 input nodes and two output nodes. 
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Figure 10: Feature vectors deriving from two different neighborhood sizes: in 
the first case (top), the 7x7x3 neighborhood is rolled down into a vector of 
147 entries; in the second case (bottom), the 13x13x5 neighborhood is down- 
sampled and rolled down to obtain a vector of 147 features, too. In both cases, 
the 6 features deriving from the gradient and the Hessian matrices are added. 
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The training and testing phase was performed according to the 5x2 cross 
validation method [30] . This method consists in performing 5 repUcations 
of the 2-fold cross validation method. In each replication, the teaching set 
is randomly partitioned into two sets {Ai and Bi for i = 1, . . . , 5) with an 
almost equal number of entries. The learning algorithm is trained on each 
set and tested on the other one. The results achieved in each trial for the 
correct classification of individual voxels are reported in table[T] where the 
sensitivity and the specificity values obtained on the test sets and on the 
whole teaching set in the ten trials are shown. Since the performance of a 
classifier and the comparison among different classifiers are conveniently 
evaluated in terms of the area Az under the ROC curve [31], we reported 
in table [1] also the estimated areas under the ROC curves obtained in 
each trial. The average and the standard deviation of A^ obtained in the 
10 trials on testing and teaching sets are reported too, thus showing the 
effectiveness and the robustness of the neural classifier performance. 



Teaching set 


On 


testing set 




On 


teaching set 




Train 


Test 


Sens. (%) 


Spec. (%) 


A. 


Sens. (%) 


Spec. (%) 


A. 


Ai 


Bi 


74.1 


79.7 


0.850 


85.8 


83.0 


0.921 


Bi 


Ai 


79.0 


82.6 


0.890 


87.3 


85.5 


0.936 


A2 


B2 


75.4 


85.9 


0.884 


83.3 


88.8 


0.927 


B2 


A2 


78.5 


80.0 


0.858 


86.9 


82.4 


0.916 


^3 


B3 


73.2 


85.9 


0.873 


82.3 


87.4 


0.919 


S3 


A3 


79.4 


79.9 


0.872 


85.6 


82.7 


0.916 


^4 


Bi 


73.7 


84.4 


0.868 


82.5 


86.8 


0.915 


Bi 


Ai 


74.0 


82.3 


0.857 


85.6 


85.9 


0.924 


A5 


B5 


77.6 


79.3 


0.865 


84.8 


81.0 


0.907 


B5 


A5 


74.3 


87.1 


0.888 


83.5 


89.3 


0.932 


Average 






0.871 






0.921 


Std deviation 






0.014 






0.008 



Table 1: Evaluation of the performance of the standard back-propagation learn- 
ing algorithm for the neural classifier according to the 5x2 cross validation 
method. 

Among the networks with a similar performance on test sets, the second 
one in table[T]was more balanced with respect to sensitivity and specificity 
on the test set and achieved the best performance on the teaching set. 
Moreover it was the network with the largest area under the ROC curve, so 
it was expected to be the one with the greatest discrimination capability. 
Therefore this trained neural network was applied to the ROIs voxels. 

3.3.3 The application of the trained neural network: from 
voxels to ROIs 

To be sure to make the approach sensible to all our nodule dimension 
target, we decided to apply the trained network at two different scales, 
characterized by two different thresholds on the number of voxels in the 
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ROI and feature vectors deriving from the two different size voxel neigfi- 
borhoods. 

In particular, for the first scale only ROIs with more than 20 voxels are 
processed and for each voxel the network is applied to the feature vector 
deriving from the 7x7x3 neighborhood. 

For the second scale, only ROIs with more than 50 voxels are processed and 
for each voxel the network is applied to both the feature vectors that can 
be associated to the voxel, that is to say the feature vector deriving from 
the 7x7x3 neighborhood, obtaining the output outi — {outi^i, outi^2), 
and the feature vector deriving from the 13x13x5 neighborhood, obtain- 
ing the output out2 — {out2,i,out2.2)- Then the final output assigned to 
the voxel for the second scale is outi if \outi,i — outi,2 \ > \0ut2.1 — out2.2\, 
out2 otherwise. 

This choice of the two thresholds of 20 and 50 voxels is done only depend- 
ing on the analysis of the teaching set, as well as the parameters used 
in the training and testing phase: as explained in the subsection 13.3.21 
no nodule in the teaching set has less than 20 voxels in its correspond- 
ing ROI. Moreover, nodules that could not be recognized using only the 
7x7x3 neighborhood have more than 50 voxels. In both cases, the choice 
of the threshold is conservative and it is expected not to compromise the 
system capability to generalize. 

At each scale a nodule candidate is then classified as "CAD nodule" if the 
percentage of voxels in its ROI tagged as "nodule" by the neural classifier 
is above a threshold. By varying these thresholds at the two scales a free 
response receiver operating characteristic (FROG) curve can be evaluated. 



4 RESULTS 

In figure [TT] the distribution of the positions of the 30 internal nodules of 
the teaching set in the lists of internal nodule candidates provided by the 
filter is shown. 

As it can be noticed, the lists provided by the dot-enhancement filter for 
the 15 scans of teaching set have to be truncated at 140 to include all 
annotated nodules. We fixed this parameter and we evaluated the filter 
sensitivity on the validation set of 24 scans containing 45 nodules, by 
truncating the lists at different values around it, in particular we truncated 
the lists at a value M, for M = 80, 100, 120, 140, 160, 180, 200. The 
results are shown in tabled 

We then applied the MS-VBNA to the same truncated lists and we eval- 
uated the performance of our CAD system in terms of FROC curves. In 
figure 1121 the FROC curves obtained on the validation set by truncat- 
ing the lists at AI — 80, 140, 200 are shown. It can be noticed that 
the maximum sensitivity achieved by the CAD system is clearly different 
for different values of M, due to the different filter sensitivity. However, 
FROC curves are very close to each other up to a very good sensitivity 
(85% range), thus proving the robustness of the system. In particular, a 
sensitivity of 86.7% at 5.4-7.6 FP/scan and a sensitivity of 84.4% at 4.1- 
5.8 FP/scan are measured. In other words, the MS-VBNA maintains a 
very good sensitivity (86.7%) by eliminating 91-96% of dot-enhancement 
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nodule position in filter output lists 



Figure 11: Distribution of the positions of the 30 internal nodules of the teaching 
set in the lists provided by the filter. 



Lists liuiical(Hl at 


Filti^r s(>usiti\"it\' (%) 


80 


88.9 


100 


91.1 


120 


91.1 


140 


93.3 


160 


95.6 


180 


95.6 


200 


95.6 



Table 2: Dot-enhancement filter sensitivity. 



filter false positive findings. For the slightly lower sensitivity of 84.4%, 
the false positive reduction rate of the MS-VBNA is 94-97%. 

5 CONCLUSIONS 

A CAD system for the identification of internal nodules with diameters 
greater than 5 mm was developed. The three basic modules of the sys- 
tem are described and the results obtained on a validation datasct of 24 
low-dose CT scans with 1-mm reconstructed slice thickness containing 45 
internal nodules are presented. A sensitivity of 86.7% was obtained at a 
low level of false positive findings (5.4-7.6 FP/scan); the sensitivity re- 
mains high (84.4%) even at 4.1-5.8 FP/scan. In particular, the procedure 
wc developed for false positive reduction (MS-VBNA) works in a very 
satisfactory way: it eliminates more than 90% of dot-enhancement filter 
false positive findings and maintains a very good sensitivity. 
The results obtained so far are promising, but further work is foreseen. 
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Figure 12: FROC curves on the validation set of 24 scans containing 45 internal 
nodules. 



The basic concept of CAD is to provide a second opinion to assist radi- 
ologists' image interpretation [32]. Many studies [33-37] investigated and 
proved the CAD ability to improve the performance of radiologists in lung 
nodule detection in screening and clinical CT examininations. 
For example, in [33] Brochu et al. compared the performance achieved 
by three radiologists with different levels of experience in detecting lung 
nodules in 30 screening CT examinations, without the assistance of CAD 
and using a CAD system as second reader. The CAD system used in the 
study was the commercial system ImageChecker@V1.0 (R2 Technology). 
The CAD alone achieved a sensitivity of 79% in the detection of nodules 
measuring 4 mm or larger, at a rate of 3.5 FP per examination. The 
sensitivity of the three radiologists in the detection of the same nodules 
before and after using the CAD system as second reader varied from a 
mean of 59% to a mean of 90%, with a gain of 31%. 

In the next phase of our work we plan to evaluate the effect of our CAD 
system as second reader on the performance of radiologists with different 
levels of experience. 

Moreover, a validation of our CAD system against a larger database is 
required. To this aim, a larger database, not only of baseline but also of 
repeat and follow up examinations is being collected. 
As explained in section [2] according to ITALUNG-CT screening protocol, 
in follow up examinations the nodules with diameters in the 3-5 mm range 
should also be identified; therefore we intend to adapt our procedure, by 
adjusting the parameters at every step, such as the band thickness in 
lung internal region segmentation, the multi-scale filter range in nodule 
candidate identification and the voxel neighborhood size in false-positive 
reduction, to this new dimension target, in order to develop a CAD system 
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useful in every phase of a screening program, on the basis of radiologists' 
requests. 
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